Kuaishou Opensources KAT-V1 Large Model: Significant Improvement in Autonomous Thinking Ability 40B Version Performance Close to 40B Performance Approaching R1-0528
Kuaishou opensources the KAT-V1 autonomous thinking large model, which includes two versions: 40B and 200B. The 40B version performance is close to DeepSeek-R1, and the 200B version outperforms several flagship models. The model innovatively adopts a mixed training paradigm of short and long-term thinking and the Step-SRPO reinforcement learning algorithm, which can automatically adjust the thinking mode based on the complexity of the question, solving the problem of overthinking. Based on Qwen2.5-32B, it achieves excellent performance in fields such as science and code through an heterogeneous distillation framework and pre-training with 10 million examples.